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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: ' ^ 

. Application Serial Number: 

Source: 

v y 

Date Processed by STIC: 1\)3q1o»' J 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. / 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY^ 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mai! help: Datin21help@usDto.ebv or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: nattn3help@usDto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form \ 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). j 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence j 
• listings, thus saving time and money. 

: Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://wvm.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



frbAR nKTECTED ^nn.vQTvn CORRECTION 



SERIAL NUMBER: 4>%1 



ATTN - NEW RULES CASES: PLEASE DISREGARD ENCLISII "ALPHA" HEADERS, WHICH WERE INSERTED DY gTO SOFTWARE 

prevent "wrapping." 

2 Invalid Line Le.fh.Tbe rules require that a .ine no, eicccd 72 characters in length. Tlus includes iMejpce* 

3 ^Misaligned Amino Tke numbering under each 5* amino acid is misaligned. Do no, use tab codes beU^bers; 

Numbering use space characters, instead. 

4 No.ASCII Be subbed fde was no, saved in ASCI.(DOS) text £ g-"* * *« ^uence Rules. P.casc 
en , urc your subsequent submission is saved in ASCII ,«t 

,_**>* Length ^J-""-* £^.'21^^ 

scqucnces(s) ■ normally, r««-iuu ^ ^ , rttrtUfinl ^>-)n>.<71^> section t< 



Patcntln 2.0 
"bug" 



" , v Norma v ralcniLn wouiu auiunwuwm; & *...w-v 

Artificial or Unknown sequences. 

' . lrint^minml nlcasc insert the following lines for each skipped sequence: 

_Skippcd Sequences HtT^sraSSSSlD NO where "X" is shown) 
(OLD RULES) (2 ^^TKcE^iS^tcSsT CS: (Do no, insert any subheadings under this hcadmg) 
(5!) sequSSJJSSSSS ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the «(ii) NUMBER OF SEQUENCES:" response to include (he skipped sequences. 

Sk^d Sequences Sequence*) missing. If intentional, please .nscrt the fol.ow.ng lines for each skipped sequence. 

(NEW RULES) <2 1 0> sequence id number 

<400> sequence id number 

000 

Use of „•■ or Xaa's Use of n's and/or Xaa's have bee* £ggl » ^«5^R Y if or Xaa * s « prescn t. 
(NEW RULES) which residue n or Xaa presents. 



10 Invalid <213> 

Response 

11 Use of<220> 



12 Patcntln 2.0 

"bug** 



is Artificial Sequence 

AMC - Biotechnology Systems Branch - 06704/2001 



OIPE 



RAW SEQUENCE LISTING DATE: 11/30/2001 

PATENT APPLICATION: US/09/785,689 TIME: 12:17:09 

Input Set : A:\Seq.lst 

Output Set: N:\CRF3\11212001\I785689.raw DOGS Not Comply 

SEQUENCE LISTING Corrected Diskette Neerfow 

4 (1) GENERAL INFORMATION: ^ ~ UWQ 

6 (i) APPLICANT: Lindler, Luther E OrK^p. 

7 • Warren , Richard 

8 VanDeBerg, Lillian 

9 Rubin, Fran 

11 (ii) TITLE OF INVENTION: Protein From Brucella "Stfecies 

13 (iii) NUMBER OF SEQUENCES: 2 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: Hendricks and Associates 

17 (B) STREET: P.O. Box 2509 

18 (C) CITY: Fairfax 

19 (D) STATE: Virginia 

20 (E) COUNTRY: United States 

21 (F) ZIP: 22031 

23 (v) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: FlSppy disk 

2 5 ( B ) ..COMPUTER*: IBM PC compatible 

2 6 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

27 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 

29 (vi) CURRENT APPLICATION DATA: 

C--> 30 (A) APPLICATION NUMBER: US/09/785,68? 

C--> 31 (B) FILING DATE: 20-Feb-2001 

32 (C) CLASSIFICATION: 

34 (viii) ATTORNEY/AGENT INFORMATION : 

35 (A) NAME: Hendricks, Glenna 

36 (B) REGISTRATION NUMBER: 32,535 

37 (C) REFERENCE/DOCKET NUMBER: lindler 

39 (ix) TELECOMMUNICATION INFORMATION: 

40 (A) TELEPHONE: (703) 425-8405 

41 (B) TELEFAX: (703) 425-8406 

ERRORED SEQUENCES 

44 (2) INFORMATION FOR SEQ ID NO : 1: 

4 6 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 1025 base pairs 

4 8 (B) TYPE: nucleic acid 

4 9 (C) STRANDEDNESS : single 

50 (D) TOPOLOGY: unknown 

52 (ii) MOLECULE TYPE: DNA (genomic) 

54 (iii) HYPOTHETICAL: NO 

56 (iv) ANTI- SENSE: NO 

58 (vi) ORIGINAL SOURCE: 

59 (A) ORGANISM: Brucella 

62 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/785,689 

Input Set : A:\Seq.lst 

Output Set: N:\CRF3\11212001\I785689.raw 



DATE: 11/30/2001 
TIME: 12:17:09 



E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E- 
W- 
E-- 
W-- 
E-- 
W-- 
E-- 
W-- 
E-- 
W-- 



-> 64 CCCCTGACAT AACCCGCTTT GTCCAAATTT TTTCAACTTT 
-> 65 60 

-> 67 ACACTCGTGC TAGCAATTTT CTCGCAGCCT CATTTTCCAC 
-> 68 120 

-> 70 TCAGCCTGCC CGCTTTCGCA CAGGAGAATC AGATGACGAC 
-> 71 180 

-> 73 TCACCGGGGA AGGCXTGATG ACGGCCTCGC CCGATATGGC 
-> 74 240 

-> 76 TACGCCAGGC AAAGACCGCG CGCGAAGCCA TGACCGCGAA 
-> 77 300 

-> 79 TGCTCGATGC CATGAAGAAG GCCGGCATCG AAGATCGCGA 
-> 80 360 

-> 82 ATATCCAGCC GATTTATGTC TATCCTGACG ACAAGAACAA 
-> 83 420 

-> 85 CCGGCTATTC TGTATCCACC AGTCTCACGG TTCGCGTGCG 
-> 86 480 

-> 88 AAATTTTGGA TGAATCCGTC ACGCTCGGTG TTAATCAGGG 
-> 89 540 

-> 91 ATGATAATCC CTCCGCCGTG ATCAACGAGG CGCGCAAGCG 
600 

CCAAGGCGAA GACGCTTGCC GACGCXlgCAG GCGTGGGGCT 
660 

GTGAACTGAG CCGCCCGCCC ATGCCGATGC CAATTGCGCG 
720 

TAGCAGCCGC ACCGGACAAT TCCGTGCCGA TTGCCGCAGG 
780 

CGGTCAATGT CGTTTTTGAA ATCAAGTAAA TAGCTGGGGT 
840 

ATACAAAACG CCGGCCTGGT TTCACAGGCC GGTTTTTTTG 
900 

GATTGAATCC GATCGGCGCT CTAATCCTTT GTTTTGACGC 



92 
94 
95 
97 
98 
100 
101 
103 
104 
106 
107 
109 
110 
112 
113 
115 
116 
118 
120 
121 
122 
123 
124 
126 
128 
130 
132 
134 
135 
136 



960 

TTCACACTTT TCGGGATGCG GTCTAGCGGA TGATCGGGCA 

1020 
TCACG 

1025 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella 

(B) STRAIN: unknown 



TCCTGTAGGA GATTTTATGA 
AATCATGCTC GTCGGCGCTT 
GCAGCCCGCG CGCATCGCCG 
CATTCTCAAT CTCTCGGTGC 
TAATGAAGCC ATGACAAAAG 
TCTCC£GA£A GGCGGCATCA 
CCTGAAAGAG CCTACCATCA 
CGAACTGGCC AATGTTGGAA 
CGGTGATTTG AACCTGGTCA 
CGCAGTGGCC AATGCCATTG 
TGGCCGTGTG GTGGAAATCA 
CGGACAGTTC AGAACCATGC 
CGAAAACAGC TATAACGTAT 
ATGACGCCCT TTGCCACCTG 
ATTAGAGCGC GTTTCGATCT 
GCATCTTTTC CGAAAACCGT 
ACCGCGCGTA TCGGCAAATG 
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RAW SEQUENCE LISTING DATE : 11/30/2001 

PATENT APPLICATION : US/09/7$5, 689 TIME: 12:17:09 

Input Set : A:\Seq.lst 

Output Set: N:\CRF3\11212001\I785689.raw 



15 







139 




(Xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


2: 




Ser 






141 




Met 


Asn 


Thr 


Arg 


Ala. 


Ser j&sn 


Phe 


Leu 


Ala 


Ala 


E- 


-> 


142 


He 


























E- 


-> 


14 3 




1 








5 










10 




Ala 






146 




Met 


Leu 


Val 


Gly 


Ala 


Phe 


Ser 


Leu 


Pro 


Ala 


Phe 


E- 


-> 


14 7 


Gin 


























E- 


-> 


148 










20 










25 












151 




Met 


Thr 


Thr 


Gin 


Pro 


Ala 


Arg 


He 


Ala 


Val 


Thr Gly 


E- 


-> 


152 


Met 


























E- 


-> 


153 








35 










40 














156 




Thr 


Ala 


Ser 


Pro 


Asp 


Met 


Ala 


He 


Leu 


Asn 


Leu §er 


E- 


-> 


157 


Gin 


























E- 


-> 


158 






50 










55 










60 






161 




Ala 


Lys 


Thr 


Ala 


Arg 


Glu 


Ala 


Met 


Thr 


Ala 


Asn 


Asn 


E- 


-> 


162 


Thr 


























E- 


-> 


163 




65 










70 










75 








166 




Lys 


Val 


Leu 


Asp 


Ala 


Met 


Lys 


Lys 


Ala 


Gly 


He 


Glu 


E- 


-> 


167 


Leu 


























E- 


-> 


168 












85 










90 










171 




Gin 


Thr 


Gly 


Gly 


lie 


Asn 


He 


Gin 


Pro 


He 


Tyr 


Val 


E- 


-> 


172 


Asp 
























E- 


-> 


173 










100, 










105 












176 




Lys 


Asn* 


Asn 


Leu 


Lys 


Glu 


-Pro 


Thr 


He 


Thr 


Gly 


Tyr 


E- 


-> 


177 


Thr 


























E- 


-> 


178 








115 










120 














181 




Ser 


Leu 


Thr 


Val 


Arg 


Val 


Arg 


Glu 


Leu 


Ala 


Asn 


Val 


E- 


-> 


182 


Leu 


























E- 


-> 


183 






130 










135 










14 0 






186 




Asp 


Glu 


Ser 


Val 


Thr 


Leu 


Gly 


Val 


Asn 


Gin 


Gly 


Gly 


E- 


-> 


187 


Leu 


























E- 


-> 


188 




145 










150 










155 




E- 


-> 


189 


160 






























191 




Val 


Asn 


Asp 


Asn 


Pro 


Ser 


Ala 


Val 


He 


Asn 


Glu 


Ala 


E- 


-> 


192 


Ala 


























E- 


-> 


193 












165 










170 










196 




Val 


Ala 


Asn 


Ala 


He 


Ala 


Lys 


Ala 


Lys 


Thr 


Leu 


Ala 


E- 


-> 


197 


Gly 


























E- 


-> 


198 










180 










185 












201 




Val 


Gly 


Leu 


Gly 


Arg 


Val 


Val 


Glu 


He 


Ser 


Glu 


Leu 


E- 


-> 


202 


Pro 


























E- 


-> 


203 








195 










200 














206 




Met 


Pro 


Met 


Pro 


He 


Ala 


Arg 


Gly 


Gin 


Phe 


Arg 


Thr 


E- 


-> 


207 


Ala 


























E- 


-> 


208 






210 










215 










220 






211 




Ala 


Pro 


Asp 


Asn 


Ser 


Val 


Pro 


He 


Ala 


Ala 


Gly 


Glu 


E- 


-> 


212 


Asn 


























E- 


-> 


213 




225 










230 










235 




E- 


-> 


214 


240 






























216 




Val 


Ser 


Val 


Asn 


Val 


Val 


Phe 


Glu 


He 


Lys 







30 




45 



80 



95 



110 



125 



175 



190 



205 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/785, 689 



DATE: 11/30/2001 
TIME: 12:17:09 



-ioput_Set_: A:\Seg.lst 
Output SetriiT\CRm^4.ai2*L^^ 



245 



250 
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VERIFICATION SUMMARY DATE: 11/30/2001 

PATENT APPLICATION: US/09/7.85 f 689 TIME: 12:17:10 

Input Set : A:\Seq.lst 

Output Set: N:\CRF3\11212001\I785689.raw 

L:30 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:31 M:220 C: Keyword misspelled or. invalid format, [(B) FILING DATE:] * 
L:64 M:254 E: No. of Bases conflict, Input :0 Counted: 60 SEQ:1 
L:65 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID-1 
M:254 Repeated in SeqNo=l 

L:68 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:71 M:336 W: Invalid Amiiio Acid Number in Coding Region, SEQ ID:1 
L:74 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:77 M:336 W:* Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:80 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:83 M:336 W: Invalid Amino Acid Number in Coding Region, jSEp ID:1 
L:86 M:336 W: Invalid Amino Acid Number in Coding Region, "sBQ ID:1 
L:89 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:92 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:95 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:98 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:101 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:104 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:107 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:110 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:113 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:116 M:336 W: Invalid Amino ^cid Number in Coding Region, SEQ ID:1 
L:142 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID-0 
M:332 Repeated in SeqNo=2 
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